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Abstract.  We  examine  the  central  issnes  of  invertibility,  stability,  artifacts,  and  freqnency- 
domain  characteristics  in  the  constrnction  of  non-linear  analogs  of  the  wavelet  transform.  The 
lifting  framework  for  wavelet  constrnction  motivates  onr  analysis  and  provides  new  insight 
into  the  problem.  We  describe  a  new  type  of  non-linearity  for  nse  in  constrncting  non-linear 
transforms:  a  set  of  linear  predictors  that  are  chosen  adaptively  nsing  a  non-linear  selection 
fnnction.  We  also  describe  how  earlier  families  of  non-linear  hlter  banks  can  be  extended  throngh 
the  nse  of  prediction  fnnctions  operating  on  a  cansal  neighborhood.  We  present  preliminary 
resnlts  for  a  synthetic  test  image. 


1.  Introduction 

In  his  classic  treatise  on  the  workings  of  the  human  visual  system,  Marr  focuses  on  the 
importance  of  the  representation  of  information  for  various  cognitive  tasks  [9].  The  way  in 
which  information  is  represented  brings  out  certain  types  of  features  while  hiding  others.  Image 
compression  applications  also  rely  heavily  on  having  an  efficient  representation  of  image  data. 
Ideally  we  would  like  to  approximate  an  image  with  a  small  number  of  parameters.  The  wavelet 
transform  provides  such  an  efficient  representation.  Most  of  the  wavelet  coefficients  of  a  typical 
image  are  nearly  zero,  and  the  image  thus  is  well-approximated  with  a  small  number  of  large 
wavelet  coefficients. 

The  reason  for  the  efficiency  of  the  wavelet  representation  is  that  images  often  are  well  modeled 
as  a  set  of  locally  smooth  regions  separated  by  edges.  Within  these  smooth  regions,  hue-scale 
wavelet  coefficients  are  small,  and  coefficients  decay  rapidly  from  coarse  to  hue  scales.  In  the 
neighborhood  of  edges  wavelet  coefficients  decay  much  more  slowly,  but  because  of  the  local 
support  relatively  few  wavelet  coefficients  are  affected  by  edges. 

Still  the  large  wavelet  coefficients  near  edges  are  expensive  to  code.  Much  current  research 
focuses  on  enabling  wavelet  coders  to  exploit  the  structure  present  in  wavelet  coefficients  along 
edges.  Successful  coders  perform  some  form  of  conditioning  [11]  or  variance  prediction  [8]. 


Key  words  and  phrases,  image  coding,  nonlinear,  wavelet,  lifting. 

^Electrical  and  Gompnter  Engineering  Department,  Rice  University,  Honston,  TX. 
^Mathematics  Department,  Dartmonth  Gollege,  Dartmonth,  NH. 

‘Lncent  Technologies,  Bell  Laboratories,  Mnrray  Hill,  NJ. 

clayporl@rice.edn,  gdavis@cs. dartmonth. edn,  wim@bell-labs.com,  richb@rice.edn. 

1 


Report  Documentation  Page 

Form  Approved 

0MB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  0MB  control  number. 

1.  REPORT  DATE 

NOV  1997  2.  REPORT  TYPE 

3.  DATES  COVERED 

00-00-1997  to  00-00-1997 

4.  TITLE  AND  SUBTITLE 

Nonlinear  Wavelet  Transforms  for  Image  Coding 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Rice  University, Electrical  and  Computer  Engineering  Department, 6100 
Main  Street,  MS-366, Houston, TX, 77005 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS(ES) 

10.  SPONSOR/MONITOR’S  ACRONYM(S) 

11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

see  report 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF:  17.  LIMITATION  OF 

_ _ _  ABSTRACT 

18.  NUMBER  19a.  NAME  OF 

OF  PAGES  RESPONSIBLE  PERSON 

a.  REPORT  b.  ABSTRACT  c.  THIS  PAGE  Same  aS 

unclassified  unclassified  unclassified  Report  (SAR) 

10 

Standard  Form  298  (Rev.  8-98} 

Prescribed  by  ANSI  Std  Z39-18 


In  this  paper  we  focus  on  improving  the  properties  of  the  wavelet  transform  rather  than 
the  encoder.  More  precisely,  we  try  to  build  adaptive  wavelet  transforms  that  result  in  fewer 
large  wavelet  coefficients.  Such  non-linear  wavelet  transforms  allow  for  more  flexibility  in  image 
representations.  Construction  of  non-linear  hlter  banks  is  a  straightforward  process  and  has  been 
discussed  in  [7]  and  [6].  Experiments  with  a  nonlinear  hlter  bank  for  image  coding  presented  in 
[4]  show  results  that  are  quite  promising.  The  key  open  question  in  the  use  of  these  non-linear 
constructions  is  one  of  design:  what  is  the  most  effective  way  to  utilize  the  additional  degrees 
of  freedom  obtained  from  relaxing  the  constraint  of  linearity? 

We  examine  the  central  issues  of  invertibility,  stability,  artifacts,  and  frequency- domain  char¬ 
acteristics  (to  the  extent  to  which  these  are  well-dehned)  in  the  construction  of  non-linear  ana¬ 
logues  of  the  wavelet  transform.  Our  analysis  is  motivated  by  the  new  perspective  provided  by 
the  lifting  framework  for  the  wavelet  transform.  We  describe  a  useful  new  type  of  non-linearity 
for  use  in  constructing  our  non-linear  transforms:  a  set  of  linear  predictors  that  are  chosen 
adaptively  using  a  non-linear  selection  function.  We  also  describe  how  the  family  of  non-linear 
hlter  banks  of  [7]  and  [6]  can  be  extended  through  the  use  of  prediction  functions  operating  on 
a  causal  neighborhood. 


2.  The  Lifting  Scheme 

Our  non-linear  transform  is  most  easily  described  in  terms  of  the  lifting  framework  [13,  12]. 
The  main  feature  of  lifting  is  that  it  provides  an  entirely  spatial  interpretation  of  the  wavelet 
transform,  as  opposed  to  the  more  traditional  Fourier  based  constructions.  This  spatial  in¬ 
terpretation  will  enable  us  to  construct  spatially  varying  and  adaptive  or  non-linear  wavelet 
transforms. 

Let  X[n\  be  a  signal.  Our  goal  is  to  obtain  a  more  efficient  representation  of  X[ra],  i.e.,  a 
representation  in  which  most  of  the  coefficients  are  near  zero.  We  hrst  partition  X[n\  into  its  even 
and  odd  polyphase  components  Xe[ra]  and  Xo[ra],  where  Xe[ra]  =  X[2ra]  and  Xo[ra]  =  X[2ra  -|-  1]. 

If  the  X[n\  correspond  to  the  samples  of  an  underlying  smooth  function,  then  the  even  and 
odd  polyphase  components  are  highly  correlated.  This  correlation  structure  is  typically  local 
and  thus  we  should  be  able  to  accurately  predict  a  coefficient  from  the  odd  polyphase  component 
based  on  nearby  coefficients  from  the  even  polyphase  component. 

Prediction:  In  the  interpolating  formulation  of  lifting,  we  hrst  predict  the  odd  polyphase 
coefficients  Xo[ra]  from  the  neighboring  even  coefficients  Xe[ra].  The  predictor  for  each  Xo[ra]  is 
a  linear  combination  of  neighboring  even  coefficients: 

P{Xe)[n\  =  'Y^piXe[n  -7  /]. 

l 

We  obtain  a  new  representation  of  the  X[n\  by  replacing  Xo[ra]  with  the  prediction  residual. 
This  leads  to  the  hrst  lifting  step: 


H[n]  =  Xo[n\-  P{X,)[n]. 
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(1) 


If  the  underlying  signal  is  locally  smooth,  the  prediction  residuals  H[n]  will  be  small.  Further¬ 
more,  the  new  representation  contains  the  same  information  as  the  original  signal  X[ra]:  given 
the  even  polyphase  Xe[ra]  and  the  prediction  residuals  H[n\,  we  can  recover  the  odd  polyphase 
coefficients  Xo[ra]  by  noting  that 

Xo[n\  =  H[n\  + P{X,)[n\. 

This  prediction  procedure  is  equivalent  to  applying  a  high-pass  hlter  to  X[n\  (hence  the 
notation  H[n\).  The  prediction  hlter  is  typically  designed  so  that  it  is  perfect  for  polynomials 
up  to  and  including  degree  iV  —  1.  In  wavelet  terminology,  the  synthesis  scaling  function  to 
which  the  prediction  hlter  gives  rise  can  reproduce  polynomials  of  degree  up  to  iV  —  1  and  the 
dual  wavelet  has  N  zero  moments. 

Update:  The  second  lifting  step  transforms  the  even  polyphase  coefficients  Xe[ra]  into  a  low- 
passed  and  subsampled  version  of  X[ra].  We  obtain  this  low-pass  hltering  version  by  updating 
Xe[ra]  with  a  linear  combination  of  the  prediction  residuals  H[n].  We  replace  Xe[ra]  with 

L[n]=  X,[n]  +  U(H)[n],  (2) 

where  U(H)  is  a  linear  combination  of  neighboring  H  values: 

U{H)[n]  =  Y^uiH[n+l]. 

l 

Because  each  lifting  step  is  always  invertible,  again  no  information  is  lost.  Given  L[n\  and  P[ra], 
we  have 


and  again 


Xe[n\  =  L[n]  -  U{H)[n] 


Xo[n]  =  H[n]  + P{X,)[n]. 

Note  that  H  and  L  are  at  half  rate,  and  thus  this  transform  corresponds  to  a  critically  sampled 
perfect  reconstruction  hlter  bank.  One  can  show  that  the  update  function  determines  the  prop¬ 
erties  of  the  primal  wavelet  and  dual  scaling  function.  In  particular,  if  the  update  hlter  is  one 
half  the  adjoint  of  the  predict  hlter  then  the  primal  wavelet  has  N  zero  moments  as  well  [12]. 

A  simple  example  of  lifting  is  the  construction  of  the  Deslauriers-Dubuc  family  of  wavelets 
from  a  single  prediction  step  followed  by  a  single  update  step.  The  following  prediction  and 
update  steps  are  equivalent  to  performing  a  single  stage  of  the  (4,4)  Deslauriers-Dubuc  wavelet 
transform: 


i7[ra]  —  Ao[ra]  —  (Xe[n  ~  1]  +  9Ae[n]  -|-  9Xe[n  +  1]  +  Xe[n  2])  /16  (3) 

L[n]  =  X,[n]  +  (H[n-2]  +  9H[n-l]  +  9H[n]  +  H[n+l])/32  (4) 

The  predict  step  cancels  cubic  polynomials  and  leaves  the  residual  in  the  high  pass  signal 
H[n].  The  update  step  results  in  a  low-passed  and  subsampled  version  of  X  being  placed  in 
L.  It  should  be  emphasized  that  lifting  is  a  general  construction  that  is  not  limited  to  the 
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Deslauriers-Dubuc  family.  Using  the  Euclidean  algorithm,  one  can  decompose  any  FIR  wavelet 
transform  into  a  sequence  of  prediction  and  update  steps  [3]. 

3.  Nonlinear  Lifting 

The  analysis  Liters  used  for  wavelet  compression  applications  typically  correspond  to  fourth 
order  polynomial  predictors  like  the  one  described  above.  Such  predictions  work  well  if  the 
underlying  signal  is  locally  smooth.  However,  these  predictions  break  down  when  the  signal  is 
not  locally  well-modeled  as  a  low-order  polynomial.  In  particular,  the  predictions  work  poorly 
near  edges  and  other  singularities. 

Our  goal  is  to  switch  between  different  predictors  based  on  the  local  properties  of  the  im¬ 
age.  This  makes  the  P  operator  data  dependent  and  thus  non-linear;  lifting  guarantees  that 
the  transform  remains  reversible.  Where  the  image  is  locally  smooth  we  use  higher  order  pre¬ 
dictors.  Near  edges  we  reduce  the  order  and  thus  the  length  of  the  predictor  as  to  avoid  using 
coefficients  across  edges.  Ideally  we  would  like  to  use  predictors  that  take  into  account  the  fact 
that  discontinuities  in  images  tend  to  occur  along  continuous  curves.  Such  an  adaptation  would 
allow  us  to  exploit  the  additional  spatial  structure  that  we  know  exists  in  edges. 

3.1.  Problems.  Adapting  the  predictor  makes  the  transform  non-linear;  thus  the  transform  of 
the  sum  of  two  images  is  no  longer  the  sum  of  each  of  the  transforms.  Consequently,  it  no  longer 
makes  sense  to  talk  about  the  concept  of  basis  functions,  which  relies  fundamentally  on  linear 
superposition.  We  thus  focus  on  the  properties  of  the  transform. 

There  are  two  problems  with  making  the  above  predict-update  lifting  non-linear: 

1.  We  need  a  coherent  interpretation  of  the  updated  coefficients.  After  the  first  iteration  of 
our  transform,  we  are  basing  all  of  our  predictions  on  updated  coefficients.  If  we  are  to 
make  effective  predictions  beyond  the  first  iteration  of  the  transform,  we  need  some  kind 
of  structure  in  the  update. 

Furthermore,  the  reason  why  wavelets  provide  efficient  representations  lies  in  their  space- 
frequency  localization  properties.  The  spatial  localization  comes  from  the  fact  that  the  filter 
operations  are  local;  this  immediately  carries  over  to  the  non-linear  case.  The  frequency  lo¬ 
calization  comes  from  the  interpretation  of  the  H  as  a  band-pass-filtered  and  downsampled 
version  of  the  signal  and  L  as  a  low-pass-filtered  and  downsampled  version  of  the  signal. 
Maintaining  this  in  the  non-linear  case  is  highly  non-trivial. 

Consider  again  the  example  (4).  While  it  is  easy  to  see  that  the  prediction  filter  P  leads 
to  a  high  pass  filter,  it  is  not  immediately  clear  that  the  update  U  leads  to  a  low  pass  filter. 
The  reason  is  that  the  lifting  structure  mandates  that  the  high  pass  coefficients  H  must 
be  reused  in  the  computation  of  L,  and  thus  L  depends  both  on  P  and  on  U.  By  carefully 
adjusting  the  update  U  to  the  prediction  P,  one  can  ensure  that  P  is  a  low-pass-filtered 
and  subsampled  version  of  the  original  signal.  In  the  example  U(H)  had  to  be  chosen  as 
(H[n  —  2]  -|-  9H[n  —  1]  -|-  9H[n]  -f  H[n  -f  l])/32.  While  it  is  known  how  to  adjust  U  for  a 
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spatially  varying,  but  linear  P  [13],  it  is  not  immediately  clear  how  to  build  a  non-linear 
U  adjusted  to  a  non-linear  P. 

2.  We  need  to  ensure  that  the  transform  is  stable.  Lossy  coding  schemes  introduce  errors  into 
the  transform  coefficients,  so  it  is  crucial  that  the  non-linearities  do  not  unduly  amplify 
these  errors.  Our  goal  is  to  use  a  high  order  predictor  in  smooth  regions  and  a  lower 
order  prediction  near  edges.  In  order  to  avoid  sending  side  information  on  which  predictor 
was  chosen  we  need  to  base  the  choice  only  on  the  Xe[ra].  However,  in  lossy  compression 
the  decoder  only  has  quantized  even  coefficients  Xe[ra]  rather  than  the  original  coefficients 
Xe[ra].  If  we  use  locally  adapted  Liters,  quantization  errors  in  coarse  scales  could  cascade 
across  scales  and  cause  a  series  of  incorrect  filter  choices  leading  to  serious  reconstruction 
errors.  The  problem  cannot  be  solved  by  synchronization,  i.e.,  having  the  encoder  make  its 
choice  of  predictor  based  on  quantized  even  values.  The  reason  is  that  the  reconstructed 
values  Xe[ra]  are  obtained  from  quantized  low-pass  values  L[n].  The  low  pass  signal  L[n]  is 
a  function  of  the  prediction  residual  signal  H[n],  which  in  turn  depends  on  what  filters  are 
chosen  for  prediction.  Hence  the  encoder  cannot  obtain  the  quantized  values  Xe[ra]  until 
it  selects  a  predictor,  and  it  cannot  select  a  predictor  without  obtaining  Xe[ra].  If  we  are 
to  employ  a  non-linear  lifting  procedure  for  lossy  coding,  it  is  essential  that  we  avoid  this 
stability  problem. 


3.2.  Solution.  We  propose  a  simple  modification  which  solves  both  problems:  switch  the  pre¬ 
dict  and  update  lifting  steps  in  the  wavelet  transform.  We  thus  first  update  the  even  samples 
based  on  the  odd  samples  yielding  the  low  pass  coefficients.  We  then  reuse  these  low  pass  co¬ 
efficients  to  predict  the  odd  samples,  which  gives  the  high  pass  coefficients.  We  use  a  linear 
update  filter  and  let  only  the  choice  of  predictor  depend  on  the  data.  Because  we  update  first 
and  the  transform  is  only  iterated  on  the  low  pass,  all  low  pass  coefficients  throughout  the  entire 
pyramid  linearly  depend  on  the  data  and  are  not  affected  by  the  non-linear  predictor.  Thus  the 
prediction  is  only  based  on  low  pass  coefficients  which  are  computed  as  in  the  classical  wavelet 
transform.  Furthermore,  if  we  perform  the  transform  backwards,  i.e.,  starting  the  prediction 
process  at  the  lowest  frequency  (coarsest)  subband  and  working  from  coarse  to  fine  scales,  we 
can  keep  the  encoder  and  decoder  perfectly  synchronized. 

The  question  remains  on  how  to  find  the  P  and  U  filters  even  in  the  linear  case.  One  idea  is 
to  simply  use  the  same  P  and  U  filters  from  the  Deslauriers-Dubuc  family  but  in  reversed  order; 
this  results  in  switching  the  analysis  and  synthesis  functions.  This,  however,  is  problematic  for 
coding  apphcations,  because  the  analysis  wavelets  in  the  Deslauriers-Dubuc  family  are  much 
less  smooth  than  the  synthesis  wavelets.  Since  reconstructed  images  are  built  up  from  synthesis 
wavelets,  these  non-smooth  building  blocks  would  lead  to  highly  visible  artifacts  in  the  recon¬ 
structed  image.  Furthermore,  trying  to  boost  the  smoothness  of  the  new  building  blocks  leads 
to  needlessly  long  filters  which  cause  ringing. 
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Instead  we  propose  a  solution  based  on  Donoho’s  average-interpolation  which  hts  into  the 
update-predict  form  of  lifting  [5,  14].  It  leads  to  the  (l,iV)  branch  of  the  Cohen-Daubechies- 
Feauveau  family  which  is  biorthogonal  to  the  box  function  [2].  Let  us  consider  a  simple  example. 
The  low  pass  coefficients  are  computed  using  a  Haar  hlter: 

L[n\  =  {x[2n\  -\-  x[2n  -\-  l])/2. 

The  high  pass  coefficients  are  the  residuals  of  a  prediction  of  the  odd  samples  based  on  the  L[n\. 
A  hrst  order  Haar  prediction  is 

H[n\  =  x[2n  +  1]  —  T[n], 

while  a  third  order  predictor,  i.e.,  one  that  is  exact  for  quadratics,  is  given  by 

H[n]  =  x[2n  +  1]  —  {  —  L[n  —  l]/8  -|-  L[n]  -\-  L[n  -\-  l]/8). 

Higher  order  predictors  can  be  build  in  a  straightforward  way.  The  smoothness  of  the  resulting 
scaling  functions  increases  with  the  order.  A  lower  bound  for  the  Holder  regularity  R{N)  as 
a  function  of  N  is  given  by  i?(3)  =  .678,  i?(5)  =  1.272,  i?(7)  =  1.826,  i?(9)  =  2.354,  and 
asymptotically  R{k)  ~  .2075iV  [5].  In  numerical  experiments  the  order  (1,7)  hlter  set  has  been 
found  to  yield  performance  approaching  that  of  the  7-9  hlter  set  that  is  more  commonly  used  in 
coding  applications. 


Figure  1.  Top  row:  analysis  (left)  and  synthesis  (right)  scaling  functions  for  the 
order  (1,7)  Cohen-Daubechies-Feauveau  hlter  used  in  our  experiments.  Bottom 
row:  analysis  and  synthesis  wavelets.  These  basis  functions  correspond  to  the 
update-predict  form  of  lifting. 
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Our  goal  is  to  use  non-linear  predictor  operators  P{L)  so  that 

H[n]  =  x[2n  -|-  1]  —  P{L)[n], 

obtained  by  switching  between  different  linear  predictors.  We  classify  coefficients  at  a  given 
scale  into  “edge”  coefficients  and  “non-edge”  coefficients  based  on  the  gradient  at  the  next 
coarser  scale.  For  a  region  containing  non-edge  coefficients,  we  use  the  7th  order  predictor. 
Near  edges  we  reduce  the  order  of  the  predictor  so  that  the  neighborhood  we  use  for  prediction 
never  overlaps  the  edge.  This  way  we  maintain  high  accuracy  away  from  edges,  and  avoid  large 
errors  in  the  presence  of  edges.  Figure  2  illustrates  the  process  of  selecting  these  predictors  near 
an  ideal  step  edge. 


1  3  5  7  7  7  7 

))<  X  O  XT3  X  0  X  0  X  CFX  O 


XeM 


Xo[n] 


X  0  X  0  X  GbX  OXT>  XT3  X  0 

7  7  7  7  5  3  1 


Figure  2.  Predictor  selection  at  an  ideal  step  edge.  Numbers  indicate  the  order 
of  the  predictors  used.  The  closer  to  the  edge,  the  lower  order  predictor  is  used. 

As  we  stressed  in  the  previous  section,  maintaining  synchronization  between  the  adaptations 
of  the  encoder  and  the  decoder  is  essential  for  a  stable  inversion.  Encoding  of  a  p-level  transform 
proceeds  as  follows:  we  hrst  compute  the  coarsest  scale  coefficients  of  the  transform  LP[n]  by 
iterating  the  linear  update  procedure  p  times.  We  then  quantize  LP[n]  to  LP[n]  and  send  them. 
Then  we  compute  the  high  pass  coefficients  H[n]  as 

H[n]  =  LP-^[2n  +  1]  -  P(2^)[ra], 

quantize  them  to  H[n]  and  send  them.  Both  encoder  and  decoder  now  need  the  quantized  values 
of  the  next  hner  scale  the  even  and  odd  components  are  respectively  computed  by  undoing 

the  prediction  and  updating  step,  but  now  based  on  the  quantized  values  T^[ra]: 

=  T^“^[2ra-|-1]  =  H^[n]  +  P(L^)[n]  (5) 

Ll-\n]  =  LP-\2n]  =  2LP[n]  -  LP-\2n  +  1] 

We  now  can  compute  the  high  pass  coefficients  on  the  next  hner  level.  By  basing  our  choice  of 
predictor  at  each  stage  on  the  quantized  values  L,  we  maintain  synchronization  between  encoder 
and  decoder. 
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3.3.  Further  Adaptivity.  Non-linear  lifting  also  allows  us  to  use  not  only  the  low-passed 
coefficients  for  prediction  of  Xo[ra],  but  also  other  odd  coefficients  in  a  causal  neighborhood  of 
Ao[ra].  Suppose  our  signal  X[n\  is  a  row  in  an  image.  We  would  predict  Xo[ra]  from  low  pass 
coefficients  on  its  left  and  right.  Suppose  there  is  a  vertical  step  edge  near  Xo[ra].  The  precise 
location  of  the  edge  cannot  be  determined  from  the  low  pass  coefficients  Xo[ra].  However,  if  we 
know  the  value  of  the  coefficients  from  the  row  immediately  above  Xo[ra],  we  can  determine  the 
orientation  and  strength  of  the  edge,  and  we  can  use  this  information  in  the  prediction  of  Xo[ra]. 

The  use  of  coefficients  from  a  causal  neighborhood  offers  signihcant  potential  reductions  in 
prediction  errors.  This  increased  flexibility  comes  at  the  price  of  decreased  stability.  Consider 
the  example  above  in  which  we  resolve  difficulties  in  predicting  the  location  of  a  vertical  edge 
in  a  row  of  coefficients  by  using  already  inverted  coefficients  in  the  row  above.  Such  a  scheme 
permits  a  quantization  error  in  one  row  to  propagate  along  a  vertical  edge  to  all  other  rows. 
We  can  prevent  such  propagation  by  again  using  a  DPCM-like  strategy  of  using  quantized  data 
from  the  causal  neighborhood  for  making  predictions  in  the  encoder  as  well  as  in  the  decoder. 

3.4.  Related  Work.  The  update-then-predict  lifting  scheme  we  describe  is  related  to  the  Lapla- 
cian  pyramid  of  Burt  and  Adelson  [1]  in  which  images  are  represented  as  a  series  of  prediction 
residuals,  and  the  predictors  are  not  constrained  to  being  linear.  The  Laplacian  pyramid  has 
the  disadvantage  that  it  expands  the  number  of  coefficients  in  the  image  being  transformed  by 
a  factor  of  4/3.  Lifting,  on  the  other  hand,  guarantees  a  critically  sampled  decomposition. 

Our  non-linear  lifting  framework  generalizes  the  ideas  of  de  Quieroz  et  al.  [4],  and  makes  clear 
the  relationship  between  the  non-linear  hlter  banks  described  by  these  authors  and  the  wavelet 
transform.  The  hlter  bank  described  in  [4]  uses  the  update  function  f7  =  0  and  a  non  separable 
prediction  function  P  that  returns  the  median  of  its  arguments.  This  hlter  bank  performs 
particularly  well  for  test  images  containing  sharp  edges,  such  as  the  cameraman  image  and 
images  of  text.  Perceptually  this  non-linear  transform  has  signihcant  advantages  near  edges, 
and  it  minimizes  problems  with  ringing  around  edges.  However,  the  transform  suffers  from 
speckling  artifacts  due  to  aliasing  of  high  frequency  noise  into  the  low  pass  subbands.  The  use 
of  a  better  update  function  has  the  potential  to  eliminate  this  speckling  while  maintaining  high 
quality  reconstruction  around  edges. 


4.  Preliminary  Results 

Figure  3  shows  the  result  of  a  simple  feasibility  test  of  an  edge-avoiding  adaptive  transform. 
We  demonstrate  the  ability  of  the  adaptive  transform  to  reduce  the  number  of  non-zero  coeffi¬ 
cients  by  computing  the  transform,  thresholding  the  transform  coefficients  with  a  large  threshold, 
and  inverting  the  result.  We  use  a  black  disk  on  a  grey  background  as  our  test  image,  as  this 
image  contains  edges  in  all  orientations.  The  hgure  on  the  left  is  obtained  by  performing  the 
wavelet  transform  using  the  (1,7)  Cohen-Daubechies-Feauveau  functions  shown  in  Figure  1.  The 
hgure  on  the  right  was  obtained  using  an  adaptive  transform. 
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Figure  3.  Reconstructed  unit  disks  with  threshold  T  =  1.  The  left  image  was 
transformed  with  the  (1,7)  update-predict  linear  lifting.  The  right  was  trans¬ 
formed  with  an  edge-avoiding  non-linear  lifting  procedure.  We  assume  that  the 
forward  and  inverse  transforms  make  the  same  edge  decisions. 


For  our  adaptive  transform  we  choose  a  filter  from  the  (1,  iV)  branch  of  the  CDF  family,  where 
N  G  {1,3,  5,  7}.  The  choice  of  predictor  is  based  on  an  edge  analysis  of  each  7-point  prediction 
window.  We  locate  edges  in  the  data  by  hnding  local  maxima  in  the  gradient  of  the  coefficients. 
We  estimate  the  size  of  an  edge  by  comparing  an  average  of  coefficients  on  either  side  of  the 
edge,  and  we  only  adapt  if  the  step  size  is  large  compared  to  the  local  variance  on  either  side  of 
the  edge.  The  prediction  hlter  is  chosen  as  illustrated  in  Figure  2:  we  choose  the  width  of  our 
prediction  window  to  be  the  largest  window  (up  to  width  7)  not  containing  an  edge. 

We  see  that  in  the  adaptive  transform  we  have  signihcantly  reduced  ringing  around  the  edges 
of  the  disk  and  we  have  preserved  edge  sharpness.  The  reason  is  that  edges  in  our  new  transform 
are  represented  in  a  more  compact  fashion,  and  as  a  result  there  is  less  degradation  of  the  image 
when  we  zero  out  small,  non-zero  coefficients.  Note  that  while  ringing  has  been  greatly  reduced 
in  the  horizontal  and  vertical  edges,  there  are  still  some  ringing  artifacts  in  the  diagonal  direction. 
The  reason  for  these  remaining  artifacts  is  that  we  are  using  a  separable  transform  in  which  we 
seek  to  avoid  horizontal  and  vertical  edges. 

One  future  research  avenue  that  will  allow  us  to  avoid  privileged  edge  directions  in  the  trans¬ 
form  is  to  make  use  of  a  non-separable  prediction  function.  We  can  further  improve  things  by 
predicting  values  along  edges  rather  than  across  edges  by  using  data  in  a  causal  neighborhood. 
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